A singing voice synthesis system based on sinusoidal modeling
نویسندگان
چکیده
Although sinusoidal models have been demonstrated to be capable of high-quality musical instrument synthesis [1], speech modi cation [2], and speech synthesis [3], little exploration of the application of these models to the synthesis of singing voice has been undertaken. In this paper, we propose a system framework similar to that employed in concatenation-based text-to-speech synthesizers, and describe its extension to the synthesis of singing voice. The power and exibility of the sinusoidal model used in the waveform synthesis portion of the system [1] enables high-quality, computationally-e cient synthesis and the incorporation of musical qualities such as vibrato and spectral tilt variation. Modeling of segmental phonetic characteristics is achieved by employing a \unit selection" procedure that selects sinusoidally-modeled segments from an inventory of singing voice data collected from a human vocalist. The system, called Lyricos, is capable of synthesizing very natural-sounding singing that maintains the characteristics and perceived identity of the analyzed vocalist.
منابع مشابه
Vibrato in Singing Voice: The Link between Source-Filter and Sinusoidal Models
The application of inverse filtering techniques for high-quality singing voice analysis/synthesis is discussed. In the context of source-filter models, inverse filtering provides a noninvasive method to extract the voice source, and thus to study voice quality. Although this approach is widely used in speech synthesis, this is not the case in singing voice. Several studies have proved that inve...
متن کاملSinging Voice Synthesis Combining Excitation plus Resonance and Sinusoidal plus Residual Models
This paper presents an approach to the modeling of the singing voice with a particular emphasis on the naturalness of the resulting synthetic voice. The underlying analysis/synthesis technique is based on the Spectral Modeling Synthesis (SMS) and a newly developed Excitation plus Resonance (EpR) model. With this approach a complete singing voice synthesizer is developed that generates a vocal m...
متن کاملPractical high-quality speech and voice synthesis using fixed frame rate ABS/OLA sinusoidal modeling
This paper describes algorithms developed to apply the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal modeling system to real-time speech and singing voice synthesis. As originally proposed, the ABS/OLA system is limited to unidirectional timescaling, and relies on variable frame length to accomplish time-scale modification. For speech and voice synthesis applications, unidirectional ti...
متن کامل1 Concatenation - based MIDI - to - Singing Voice Synthesis
In this paper, we propose a system for synthesizing the human singing voice and the musical subtleties that accompany it. The system, Lyricos, employs a concatenation-based text-to-speech method to synthesize arbitrary lyrics in a given language. Using information contained in a regular MIDI le, the system chooses units, represented as sinusoidal waveform model parameters, from an inventory of ...
متن کاملConcatenation-based Midi-to-singing Voice Synthesis
In this paper, we propose a system for synthesizing the human singing voice and the musical subtleties that accompany it. The system, Lyricos, employs a concatenation-based text-to-speech method to synthesize arbitrary lyrics in a given language. Using information contained in a regular MIDI le, the system chooses units, represented as sinusoidal wave-form model parameters, from an inventory of...
متن کامل